Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 396030 |
| Missing cells | 81590 |
| Missing cells (%) | 0.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 339.4 MiB |
| Average record size in memory | 898.7 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 10 |
| Text | 3 |
| DateTime | 2 |
dti is highly overall correlated with emp_length | High correlation |
emp_length is highly overall correlated with dti | High correlation |
grade is highly overall correlated with int_rate and 1 other fields | High correlation |
installment is highly overall correlated with loan_amnt | High correlation |
int_rate is highly overall correlated with grade and 1 other fields | High correlation |
loan_amnt is highly overall correlated with installment | High correlation |
open_acc is highly overall correlated with total_acc | High correlation |
pub_rec is highly overall correlated with pub_rec_bankruptcies | High correlation |
pub_rec_bankruptcies is highly overall correlated with pub_rec | High correlation |
sub_grade is highly overall correlated with grade and 1 other fields | High correlation |
total_acc is highly overall correlated with open_acc | High correlation |
application_type is highly imbalanced (98.7%) | Imbalance |
emp_title has 22927 (5.8%) missing values | Missing |
emp_length has 18301 (4.6%) missing values | Missing |
mort_acc has 37795 (9.5%) missing values | Missing |
annual_inc is highly skewed (γ1 = 41.04272475) | Skewed |
dti is highly skewed (γ1 = 431.0512254) | Skewed |
pub_rec has 338272 (85.4%) zeros | Zeros |
mort_acc has 139777 (35.3%) zeros | Zeros |
pub_rec_bankruptcies has 350380 (88.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-10-06 21:24:54.308203 |
|---|---|
| Analysis finished | 2024-10-06 21:25:25.795716 |
| Duration | 31.49 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
loan_amnt
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1397 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14113.888 |
| Minimum | 500 |
|---|---|
| Maximum | 40000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 3250 |
| Q1 | 8000 |
| median | 12000 |
| Q3 | 20000 |
| 95-th percentile | 30975 |
| Maximum | 40000 |
| Range | 39500 |
| Interquartile range (IQR) | 12000 |
Descriptive statistics
| Standard deviation | 8357.4413 |
|---|---|
| Coefficient of variation (CV) | 0.59214309 |
| Kurtosis | -0.062597535 |
| Mean | 14113.888 |
| Median Absolute Deviation (MAD) | 5500 |
| Skewness | 0.77728547 |
| Sum | 5.5895231 × 109 |
| Variance | 69846826 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 27668 | 7.0% |
| 12000 | 21366 | 5.4% |
| 15000 | 19903 | 5.0% |
| 20000 | 18969 | 4.8% |
| 35000 | 14576 | 3.7% |
| 8000 | 13539 | 3.4% |
| 6000 | 12734 | 3.2% |
| 5000 | 12443 | 3.1% |
| 16000 | 10129 | 2.6% |
| 18000 | 9195 | 2.3% |
| Other values (1387) | 235508 |
| Value | Count | Frequency (%) |
| 500 | 4 | < 0.1% |
| 700 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 900 | 1 | < 0.1% |
| 950 | 1 | < 0.1% |
| 1000 | 1448 | |
| 1025 | 4 | < 0.1% |
| 1050 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 40000 | 180 | |
| 39700 | 1 | < 0.1% |
| 39600 | 1 | < 0.1% |
| 39500 | 1 | < 0.1% |
| 39475 | 1 | < 0.1% |
| 39200 | 1 | < 0.1% |
| 38825 | 1 | < 0.1% |
| 38750 | 1 | < 0.1% |
| 38475 | 1 | < 0.1% |
| 38300 | 1 | < 0.1% |
term
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.3 MiB |
| 36 months | |
|---|---|
| 60 months |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 3960300 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 months |
|---|---|
| 2nd row | 36 months |
| 3rd row | 36 months |
| 4th row | 36 months |
| 5th row | 60 months |
Common Values
| Value | Count | Frequency (%) |
| 36 months | 302005 | |
| 60 months | 94025 | 23.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| months | 396030 | |
| 36 | 302005 | |
| 60 | 94025 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 792060 | ||
| 6 | 396030 | |
| t | 396030 | |
| m | 396030 | |
| o | 396030 | |
| n | 396030 | |
| s | 396030 | |
| h | 396030 | |
| 3 | 302005 | 7.6% |
| 0 | 94025 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3960300 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 792060 | ||
| 6 | 396030 | |
| t | 396030 | |
| m | 396030 | |
| o | 396030 | |
| n | 396030 | |
| s | 396030 | |
| h | 396030 | |
| 3 | 302005 | 7.6% |
| 0 | 94025 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3960300 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 792060 | ||
| 6 | 396030 | |
| t | 396030 | |
| m | 396030 | |
| o | 396030 | |
| n | 396030 | |
| s | 396030 | |
| h | 396030 | |
| 3 | 302005 | 7.6% |
| 0 | 94025 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3960300 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 792060 | ||
| 6 | 396030 | |
| t | 396030 | |
| m | 396030 | |
| o | 396030 | |
| n | 396030 | |
| s | 396030 | |
| h | 396030 | |
| 3 | 302005 | 7.6% |
| 0 | 94025 | 2.4% |
int_rate
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 566 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.6394 |
| Minimum | 5.32 |
|---|---|
| Maximum | 30.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 5.32 |
|---|---|
| 5-th percentile | 6.89 |
| Q1 | 10.49 |
| median | 13.33 |
| Q3 | 16.49 |
| 95-th percentile | 21.97 |
| Maximum | 30.99 |
| Range | 25.67 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.4721574 |
|---|---|
| Coefficient of variation (CV) | 0.3278852 |
| Kurtosis | -0.14394654 |
| Mean | 13.6394 |
| Median Absolute Deviation (MAD) | 3.08 |
| Skewness | 0.42066947 |
| Sum | 5401611.6 |
| Variance | 20.000192 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.99 | 12411 | 3.1% |
| 12.99 | 9632 | 2.4% |
| 15.61 | 9350 | 2.4% |
| 11.99 | 8582 | 2.2% |
| 8.9 | 8019 | 2.0% |
| 12.12 | 7358 | 1.9% |
| 7.9 | 7332 | 1.9% |
| 16.29 | 6632 | 1.7% |
| 13.11 | 6580 | 1.7% |
| 6.03 | 6291 | 1.6% |
| Other values (556) | 313843 |
| Value | Count | Frequency (%) |
| 5.32 | 2440 | 0.6% |
| 5.42 | 465 | 0.1% |
| 5.79 | 333 | 0.1% |
| 5.93 | 431 | 0.1% |
| 5.99 | 278 | 0.1% |
| 6 | 70 | < 0.1% |
| 6.03 | 6291 | |
| 6.17 | 220 | 0.1% |
| 6.24 | 1184 | 0.3% |
| 6.39 | 656 | 0.2% |
| Value | Count | Frequency (%) |
| 30.99 | 13 | |
| 30.94 | 3 | < 0.1% |
| 30.89 | 3 | < 0.1% |
| 30.84 | 1 | < 0.1% |
| 30.79 | 9 | |
| 30.74 | 4 | < 0.1% |
| 30.49 | 5 | < 0.1% |
| 29.99 | 7 | |
| 29.96 | 8 | |
| 29.67 | 15 |
installment
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 55706 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 431.8497 |
| Minimum | 16.08 |
|---|---|
| Maximum | 1533.81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 16.08 |
|---|---|
| 5-th percentile | 109.51 |
| Q1 | 250.33 |
| median | 375.43 |
| Q3 | 567.3 |
| 95-th percentile | 925.6 |
| Maximum | 1533.81 |
| Range | 1517.73 |
| Interquartile range (IQR) | 316.97 |
Descriptive statistics
| Standard deviation | 250.72779 |
|---|---|
| Coefficient of variation (CV) | 0.5805904 |
| Kurtosis | 0.78381992 |
| Mean | 431.8497 |
| Median Absolute Deviation (MAD) | 150.5 |
| Skewness | 0.98359816 |
| Sum | 1.7102544 × 108 |
| Variance | 62864.424 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 327.34 | 968 | 0.2% |
| 332.1 | 791 | 0.2% |
| 491.01 | 736 | 0.2% |
| 336.9 | 686 | 0.2% |
| 392.81 | 683 | 0.2% |
| 332.72 | 641 | 0.2% |
| 337.47 | 624 | 0.2% |
| 317.54 | 574 | 0.1% |
| 654.68 | 556 | 0.1% |
| 261.88 | 527 | 0.1% |
| Other values (55696) | 389244 |
| Value | Count | Frequency (%) |
| 16.08 | 1 | |
| 16.25 | 1 | |
| 16.31 | 1 | |
| 16.47 | 1 | |
| 19.87 | 1 | |
| 20.22 | 1 | |
| 21.25 | 1 | |
| 21.62 | 1 | |
| 21.99 | 1 | |
| 22.24 | 1 |
| Value | Count | Frequency (%) |
| 1533.81 | 1 | |
| 1527 | 1 | |
| 1503.85 | 1 | |
| 1479.49 | 1 | |
| 1464.42 | 1 | |
| 1458.25 | 1 | |
| 1451.14 | 2 | |
| 1451.12 | 2 | |
| 1445.9 | 1 | |
| 1443.76 | 1 |
grade
Categorical
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.9 MiB |
| B | |
|---|---|
| C | |
| A | |
| D | |
| E | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 396030 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | B |
| 3rd row | B |
| 4th row | A |
| 5th row | C |
Common Values
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 8.0% |
| F | 11772 | 3.0% |
| G | 3054 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 116018 | |
| c | 105987 | |
| a | 64187 | |
| d | 63524 | |
| e | 31488 | 8.0% |
| f | 11772 | 3.0% |
| g | 3054 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 8.0% |
| F | 11772 | 3.0% |
| G | 3054 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 396030 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 8.0% |
| F | 11772 | 3.0% |
| G | 3054 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 396030 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 8.0% |
| F | 11772 | 3.0% |
| G | 3054 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 396030 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 8.0% |
| F | 11772 | 3.0% |
| G | 3054 | 0.8% |
sub_grade
Categorical
HIGH CORRELATION 
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.3 MiB |
| B3 | 26655 |
|---|---|
| B4 | 25601 |
| C1 | 23662 |
| C2 | 22580 |
| B2 | 22495 |
| Other values (30) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 792060 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B4 |
|---|---|
| 2nd row | B5 |
| 3rd row | B3 |
| 4th row | A2 |
| 5th row | C5 |
Common Values
| Value | Count | Frequency (%) |
| B3 | 26655 | 6.7% |
| B4 | 25601 | 6.5% |
| C1 | 23662 | 6.0% |
| C2 | 22580 | 5.7% |
| B2 | 22495 | 5.7% |
| B5 | 22085 | 5.6% |
| C3 | 21221 | 5.4% |
| C4 | 20280 | 5.1% |
| B1 | 19182 | 4.8% |
| A5 | 18526 | 4.7% |
| Other values (25) | 173743 |
Length
| Value | Count | Frequency (%) |
| b3 | 26655 | 6.7% |
| b4 | 25601 | 6.5% |
| c1 | 23662 | 6.0% |
| c2 | 22580 | 5.7% |
| b2 | 22495 | 5.7% |
| b5 | 22085 | 5.6% |
| c3 | 21221 | 5.4% |
| c4 | 20280 | 5.1% |
| b1 | 19182 | 4.8% |
| a5 | 18526 | 4.7% |
| Other values (25) | 173743 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| 1 | 81077 | |
| 4 | 80849 | |
| 3 | 79720 | |
| 2 | 79544 | |
| 5 | 74840 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 4.0% |
| Other values (2) | 14826 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 792060 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| 1 | 81077 | |
| 4 | 80849 | |
| 3 | 79720 | |
| 2 | 79544 | |
| 5 | 74840 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 4.0% |
| Other values (2) | 14826 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 792060 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| 1 | 81077 | |
| 4 | 80849 | |
| 3 | 79720 | |
| 2 | 79544 | |
| 5 | 74840 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 4.0% |
| Other values (2) | 14826 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 792060 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 116018 | |
| C | 105987 | |
| 1 | 81077 | |
| 4 | 80849 | |
| 3 | 79720 | |
| 2 | 79544 | |
| 5 | 74840 | |
| A | 64187 | |
| D | 63524 | |
| E | 31488 | 4.0% |
| Other values (2) | 14826 | 1.9% |
emp_title
Text
MISSING 
| Distinct | 173105 |
|---|---|
| Distinct (%) | 46.4% |
| Missing | 22927 |
| Missing (%) | 5.8% |
| Memory size | 24.0 MiB |
Length
| Max length | 78 |
|---|---|
| Median length | 56 |
| Mean length | 16.586736 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6188561 |
|---|---|
| Distinct characters | 125 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 145247 ? |
|---|---|
| Unique (%) | 38.9% |
Sample
| 1st row | Marketing |
|---|---|
| 2nd row | Credit analyst |
| 3rd row | Statistician |
| 4th row | Client Advocate |
| 5th row | Destiny Management Inc. |
| Value | Count | Frequency (%) |
| manager | 39270 | 4.7% |
| of | 15802 | 1.9% |
| inc | 10469 | 1.2% |
| director | 9837 | 1.2% |
| sales | 9635 | 1.1% |
| assistant | 9259 | 1.1% |
| analyst | 7652 | 0.9% |
| specialist | 7627 | 0.9% |
| supervisor | 7501 | 0.9% |
| engineer | 7462 | 0.9% |
| Other values (55359) | 717784 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 606206 | 9.8% |
| 487836 | 7.9% | |
| r | 470449 | 7.6% |
| a | 455384 | 7.4% |
| i | 406094 | 6.6% |
| n | 405205 | 6.5% |
| t | 373457 | 6.0% |
| o | 330975 | 5.3% |
| s | 293945 | 4.7% |
| c | 244175 | 3.9% |
| Other values (115) | 2114835 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6188561 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 606206 | 9.8% |
| 487836 | 7.9% | |
| r | 470449 | 7.6% |
| a | 455384 | 7.4% |
| i | 406094 | 6.6% |
| n | 405205 | 6.5% |
| t | 373457 | 6.0% |
| o | 330975 | 5.3% |
| s | 293945 | 4.7% |
| c | 244175 | 3.9% |
| Other values (115) | 2114835 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6188561 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 606206 | 9.8% |
| 487836 | 7.9% | |
| r | 470449 | 7.6% |
| a | 455384 | 7.4% |
| i | 406094 | 6.6% |
| n | 405205 | 6.5% |
| t | 373457 | 6.0% |
| o | 330975 | 5.3% |
| s | 293945 | 4.7% |
| c | 244175 | 3.9% |
| Other values (115) | 2114835 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6188561 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 606206 | 9.8% |
| 487836 | 7.9% | |
| r | 470449 | 7.6% |
| a | 455384 | 7.4% |
| i | 406094 | 6.6% |
| n | 405205 | 6.5% |
| t | 373457 | 6.0% |
| o | 330975 | 5.3% |
| s | 293945 | 4.7% |
| c | 244175 | 3.9% |
| Other values (115) | 2114835 |
emp_length
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18301 |
| Missing (%) | 4.6% |
| Memory size | 21.4 MiB |
| 10+ years | |
|---|---|
| 2 years | |
| < 1 year | |
| 3 years | |
| 5 years | |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.6828308 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2902028 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10+ years |
|---|---|
| 2nd row | 4 years |
| 3rd row | < 1 year |
| 4th row | 6 years |
| 5th row | 9 years |
Common Values
| Value | Count | Frequency (%) |
| 10+ years | 126041 | |
| 2 years | 35827 | 9.0% |
| < 1 year | 31725 | 8.0% |
| 3 years | 31665 | 8.0% |
| 5 years | 26495 | 6.7% |
| 1 year | 25882 | 6.5% |
| 4 years | 23952 | 6.0% |
| 6 years | 20841 | 5.3% |
| 7 years | 20819 | 5.3% |
| 8 years | 19168 | 4.8% |
| (Missing) | 18301 | 4.6% |
Length
| Value | Count | Frequency (%) |
| years | 320122 | |
| 10 | 126041 | 16.0% |
| 1 | 57607 | 7.3% |
| year | 57607 | 7.3% |
| 2 | 35827 | 4.6% |
| 31725 | 4.0% | |
| 3 | 31665 | 4.0% |
| 5 | 26495 | 3.4% |
| 4 | 23952 | 3.0% |
| 6 | 20841 | 2.6% |
| Other values (3) | 55301 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 409454 | ||
| y | 377729 | |
| r | 377729 | |
| a | 377729 | |
| e | 377729 | |
| s | 320122 | |
| 1 | 183648 | |
| 0 | 126041 | 4.3% |
| + | 126041 | 4.3% |
| 2 | 35827 | 1.2% |
| Other values (8) | 189979 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2902028 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 409454 | ||
| y | 377729 | |
| r | 377729 | |
| a | 377729 | |
| e | 377729 | |
| s | 320122 | |
| 1 | 183648 | |
| 0 | 126041 | 4.3% |
| + | 126041 | 4.3% |
| 2 | 35827 | 1.2% |
| Other values (8) | 189979 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2902028 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 409454 | ||
| y | 377729 | |
| r | 377729 | |
| a | 377729 | |
| e | 377729 | |
| s | 320122 | |
| 1 | 183648 | |
| 0 | 126041 | 4.3% |
| + | 126041 | 4.3% |
| 2 | 35827 | 1.2% |
| Other values (8) | 189979 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2902028 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 409454 | ||
| y | 377729 | |
| r | 377729 | |
| a | 377729 | |
| e | 377729 | |
| s | 320122 | |
| 1 | 183648 | |
| 0 | 126041 | 4.3% |
| + | 126041 | 4.3% |
| 2 | 35827 | 1.2% |
| Other values (8) | 189979 |
home_ownership
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.7 MiB |
| MORTGAGE | |
|---|---|
| RENT | |
| OWN | |
| OTHER | 112 |
| NONE | 31 |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 5.9083277 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2339875 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | MORTGAGE |
| 3rd row | RENT |
| 4th row | RENT |
| 5th row | MORTGAGE |
Common Values
| Value | Count | Frequency (%) |
| MORTGAGE | 198348 | |
| RENT | 159790 | |
| OWN | 37746 | 9.5% |
| OTHER | 112 | < 0.1% |
| NONE | 31 | < 0.1% |
| ANY | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mortgage | 198348 | |
| rent | 159790 | |
| own | 37746 | 9.5% |
| other | 112 | < 0.1% |
| none | 31 | < 0.1% |
| any | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 396696 | |
| E | 358281 | |
| R | 358250 | |
| T | 358250 | |
| O | 236237 | |
| A | 198351 | |
| M | 198348 | |
| N | 197601 | |
| W | 37746 | 1.6% |
| H | 112 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2339875 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| G | 396696 | |
| E | 358281 | |
| R | 358250 | |
| T | 358250 | |
| O | 236237 | |
| A | 198351 | |
| M | 198348 | |
| N | 197601 | |
| W | 37746 | 1.6% |
| H | 112 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2339875 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| G | 396696 | |
| E | 358281 | |
| R | 358250 | |
| T | 358250 | |
| O | 236237 | |
| A | 198351 | |
| M | 198348 | |
| N | 197601 | |
| W | 37746 | 1.6% |
| H | 112 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2339875 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| G | 396696 | |
| E | 358281 | |
| R | 358250 | |
| T | 358250 | |
| O | 236237 | |
| A | 198351 | |
| M | 198348 | |
| N | 197601 | |
| W | 37746 | 1.6% |
| H | 112 | < 0.1% |
annual_inc
Real number (ℝ)
SKEWED 
| Distinct | 27197 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74203.176 |
| Minimum | 0 |
|---|---|
| Maximum | 8706582 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 28000 |
| Q1 | 45000 |
| median | 64000 |
| Q3 | 90000 |
| 95-th percentile | 150000 |
| Maximum | 8706582 |
| Range | 8706582 |
| Interquartile range (IQR) | 45000 |
Descriptive statistics
| Standard deviation | 61637.621 |
|---|---|
| Coefficient of variation (CV) | 0.83066015 |
| Kurtosis | 4238.5506 |
| Mean | 74203.176 |
| Median Absolute Deviation (MAD) | 21000 |
| Skewness | 41.042725 |
| Sum | 2.9386684 × 1010 |
| Variance | 3.7991963 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 15313 | 3.9% |
| 50000 | 13303 | 3.4% |
| 65000 | 11333 | 2.9% |
| 70000 | 10674 | 2.7% |
| 40000 | 10629 | 2.7% |
| 45000 | 10114 | 2.6% |
| 80000 | 9971 | 2.5% |
| 75000 | 9850 | 2.5% |
| 55000 | 9195 | 2.3% |
| 90000 | 7573 | 1.9% |
| Other values (27187) | 288075 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 600 | 1 | < 0.1% |
| 2500 | 1 | < 0.1% |
| 4000 | 2 | < 0.1% |
| 4080 | 1 | < 0.1% |
| 4200 | 1 | < 0.1% |
| 4524 | 1 | < 0.1% |
| 4800 | 6 | |
| 4888 | 1 | < 0.1% |
| 5000 | 3 |
| Value | Count | Frequency (%) |
| 8706582 | 1 | |
| 7600000 | 1 | |
| 7446395 | 1 | |
| 7141778 | 1 | |
| 7000000 | 1 | |
| 6500000 | 1 | |
| 6100000 | 1 | |
| 6000000 | 2 | |
| 5000000 | 1 | |
| 4900000 | 1 |
verification_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.9 MiB |
| Verified | |
|---|---|
| Source Verified | |
| Not Verified |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 11.585645 |
| Min length | 8 |
Characters and Unicode
| Total characters | 4588263 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Verified |
|---|---|
| 2nd row | Not Verified |
| 3rd row | Source Verified |
| 4th row | Not Verified |
| 5th row | Verified |
Common Values
| Value | Count | Frequency (%) |
| Verified | 139563 | |
| Source Verified | 131385 | |
| Not Verified | 125082 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| verified | 396030 | |
| source | 131385 | 20.1% |
| not | 125082 | 19.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 923445 | |
| i | 792060 | |
| r | 527415 | |
| V | 396030 | |
| f | 396030 | |
| d | 396030 | |
| o | 256467 | 5.6% |
| 256467 | 5.6% | |
| S | 131385 | 2.9% |
| u | 131385 | 2.9% |
| Other values (3) | 381549 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4588263 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 923445 | |
| i | 792060 | |
| r | 527415 | |
| V | 396030 | |
| f | 396030 | |
| d | 396030 | |
| o | 256467 | 5.6% |
| 256467 | 5.6% | |
| S | 131385 | 2.9% |
| u | 131385 | 2.9% |
| Other values (3) | 381549 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4588263 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 923445 | |
| i | 792060 | |
| r | 527415 | |
| V | 396030 | |
| f | 396030 | |
| d | 396030 | |
| o | 256467 | 5.6% |
| 256467 | 5.6% | |
| S | 131385 | 2.9% |
| u | 131385 | 2.9% |
| Other values (3) | 381549 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4588263 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 923445 | |
| i | 792060 | |
| r | 527415 | |
| V | 396030 | |
| f | 396030 | |
| d | 396030 | |
| o | 256467 | 5.6% |
| 256467 | 5.6% | |
| S | 131385 | 2.9% |
| u | 131385 | 2.9% |
| Other values (3) | 381549 |
issue_d
Date
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Minimum | 2007-06-01 00:00:00 |
|---|---|
| Maximum | 2016-12-01 00:00:00 |
loan_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.4 MiB |
| Fully Paid | |
|---|---|
| Charged Off |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.196129 |
| Min length | 10 |
Characters and Unicode
| Total characters | 4037973 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fully Paid |
|---|---|
| 2nd row | Fully Paid |
| 3rd row | Fully Paid |
| 4th row | Fully Paid |
| 5th row | Charged Off |
Common Values
| Value | Count | Frequency (%) |
| Fully Paid | 318357 | |
| Charged Off | 77673 | 19.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| fully | 318357 | |
| paid | 318357 | |
| charged | 77673 | 9.8% |
| off | 77673 | 9.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 636714 | |
| 396030 | ||
| a | 396030 | |
| d | 396030 | |
| y | 318357 | |
| u | 318357 | |
| P | 318357 | |
| F | 318357 | |
| i | 318357 | |
| f | 155346 | 3.8% |
| Other values (6) | 466038 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4037973 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 636714 | |
| 396030 | ||
| a | 396030 | |
| d | 396030 | |
| y | 318357 | |
| u | 318357 | |
| P | 318357 | |
| F | 318357 | |
| i | 318357 | |
| f | 155346 | 3.8% |
| Other values (6) | 466038 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4037973 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 636714 | |
| 396030 | ||
| a | 396030 | |
| d | 396030 | |
| y | 318357 | |
| u | 318357 | |
| P | 318357 | |
| F | 318357 | |
| i | 318357 | |
| f | 155346 | 3.8% |
| Other values (6) | 466038 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4037973 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 636714 | |
| 396030 | ||
| a | 396030 | |
| d | 396030 | |
| y | 318357 | |
| u | 318357 | |
| P | 318357 | |
| F | 318357 | |
| i | 318357 | |
| f | 155346 | 3.8% |
| Other values (6) | 466038 |
purpose
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.2 MiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| home_improvement | |
| other | 21185 |
| major_purchase | 8790 |
| Other values (9) |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 14.997846 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5939597 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | vacation |
|---|---|
| 2nd row | debt_consolidation |
| 3rd row | credit_card |
| 4th row | credit_card |
| 5th row | credit_card |
Common Values
| Value | Count | Frequency (%) |
| debt_consolidation | 234507 | |
| credit_card | 83019 | 21.0% |
| home_improvement | 24030 | 6.1% |
| other | 21185 | 5.3% |
| major_purchase | 8790 | 2.2% |
| small_business | 5701 | 1.4% |
| car | 4697 | 1.2% |
| medical | 4196 | 1.1% |
| moving | 2854 | 0.7% |
| vacation | 2452 | 0.6% |
| Other values (4) | 4599 | 1.2% |
Length
| Value | Count | Frequency (%) |
| debt_consolidation | 234507 | |
| credit_card | 83019 | 21.0% |
| home_improvement | 24030 | 6.1% |
| other | 21185 | 5.3% |
| major_purchase | 8790 | 2.2% |
| small_business | 5701 | 1.4% |
| car | 4697 | 1.2% |
| medical | 4196 | 1.1% |
| moving | 2854 | 0.7% |
| vacation | 2452 | 0.6% |
| Other values (4) | 4599 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 789320 | |
| d | 643129 | |
| t | 599957 | |
| i | 593335 | |
| n | 506778 | |
| e | 435403 | |
| c | 420937 | |
| _ | 356376 | 6.0% |
| a | 355447 | 6.0% |
| s | 268302 | 4.5% |
| Other values (12) | 970613 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5939597 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 789320 | |
| d | 643129 | |
| t | 599957 | |
| i | 593335 | |
| n | 506778 | |
| e | 435403 | |
| c | 420937 | |
| _ | 356376 | 6.0% |
| a | 355447 | 6.0% |
| s | 268302 | 4.5% |
| Other values (12) | 970613 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5939597 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 789320 | |
| d | 643129 | |
| t | 599957 | |
| i | 593335 | |
| n | 506778 | |
| e | 435403 | |
| c | 420937 | |
| _ | 356376 | 6.0% |
| a | 355447 | 6.0% |
| s | 268302 | 4.5% |
| Other values (12) | 970613 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5939597 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 789320 | |
| d | 643129 | |
| t | 599957 | |
| i | 593335 | |
| n | 506778 | |
| e | 435403 | |
| c | 420937 | |
| _ | 356376 | 6.0% |
| a | 355447 | 6.0% |
| s | 268302 | 4.5% |
| Other values (12) | 970613 |
title
Text
| Distinct | 48816 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 1756 |
| Missing (%) | 0.4% |
| Memory size | 25.0 MiB |
Length
| Max length | 80 |
|---|---|
| Median length | 79 |
| Mean length | 17.241127 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6797728 |
|---|---|
| Distinct characters | 101 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 41797 ? |
|---|---|
| Unique (%) | 10.6% |
Sample
| 1st row | Vacation |
|---|---|
| 2nd row | Debt consolidation |
| 3rd row | Credit card refinancing |
| 4th row | Credit card refinancing |
| 5th row | Credit Card Refinance |
| Value | Count | Frequency (%) |
| consolidation | 191014 | |
| debt | 190821 | |
| credit | 74290 | 8.4% |
| card | 68254 | 7.7% |
| refinancing | 52262 | 5.9% |
| loan | 28112 | 3.2% |
| home | 22625 | 2.6% |
| improvement | 18786 | 2.1% |
| other | 13252 | 1.5% |
| payoff | 6685 | 0.8% |
| Other values (14633) | 216173 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 735790 | |
| n | 682850 | 10.0% |
| i | 655694 | 9.6% |
| t | 545268 | 8.0% |
| e | 521003 | 7.7% |
| 494561 | 7.3% | |
| a | 445747 | 6.6% |
| d | 386101 | 5.7% |
| c | 322828 | 4.7% |
| r | 295630 | 4.3% |
| Other values (91) | 1712256 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6797728 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 735790 | |
| n | 682850 | 10.0% |
| i | 655694 | 9.6% |
| t | 545268 | 8.0% |
| e | 521003 | 7.7% |
| 494561 | 7.3% | |
| a | 445747 | 6.6% |
| d | 386101 | 5.7% |
| c | 322828 | 4.7% |
| r | 295630 | 4.3% |
| Other values (91) | 1712256 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6797728 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 735790 | |
| n | 682850 | 10.0% |
| i | 655694 | 9.6% |
| t | 545268 | 8.0% |
| e | 521003 | 7.7% |
| 494561 | 7.3% | |
| a | 445747 | 6.6% |
| d | 386101 | 5.7% |
| c | 322828 | 4.7% |
| r | 295630 | 4.3% |
| Other values (91) | 1712256 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6797728 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 735790 | |
| n | 682850 | 10.0% |
| i | 655694 | 9.6% |
| t | 545268 | 8.0% |
| e | 521003 | 7.7% |
| 494561 | 7.3% | |
| a | 445747 | 6.6% |
| d | 386101 | 5.7% |
| c | 322828 | 4.7% |
| r | 295630 | 4.3% |
| Other values (91) | 1712256 |
dti
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 4262 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.379514 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 313 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.68 |
| Q1 | 11.28 |
| median | 16.91 |
| Q3 | 22.98 |
| 95-th percentile | 31.58 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 11.7 |
Descriptive statistics
| Standard deviation | 18.019092 |
|---|---|
| Coefficient of variation (CV) | 1.0368007 |
| Kurtosis | 237923.68 |
| Mean | 17.379514 |
| Median Absolute Deviation (MAD) | 5.83 |
| Skewness | 431.05123 |
| Sum | 6882808.8 |
| Variance | 324.68769 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 313 | 0.1% |
| 14.4 | 310 | 0.1% |
| 19.2 | 302 | 0.1% |
| 16.8 | 301 | 0.1% |
| 18 | 300 | 0.1% |
| 20.4 | 296 | 0.1% |
| 12 | 293 | 0.1% |
| 13.2 | 291 | 0.1% |
| 21.6 | 270 | 0.1% |
| 15.6 | 266 | 0.1% |
| Other values (4252) | 393088 |
| Value | Count | Frequency (%) |
| 0 | 313 | |
| 0.01 | 8 | < 0.1% |
| 0.02 | 12 | < 0.1% |
| 0.03 | 5 | < 0.1% |
| 0.04 | 5 | < 0.1% |
| 0.05 | 6 | < 0.1% |
| 0.06 | 7 | < 0.1% |
| 0.07 | 7 | < 0.1% |
| 0.08 | 8 | < 0.1% |
| 0.09 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 1 | |
| 1622 | 1 | |
| 380.53 | 1 | |
| 189.9 | 1 | |
| 145.65 | 1 | |
| 138.03 | 1 | |
| 120.66 | 1 | |
| 107.55 | 1 | |
| 93.86 | 1 | |
| 92.13 | 1 |
earliest_cr_line
Date
| Distinct | 684 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Minimum | 1944-01-01 00:00:00 |
|---|---|
| Maximum | 2013-10-01 00:00:00 |
open_acc
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.311153 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 10 |
| Q3 | 14 |
| 95-th percentile | 21 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.1376488 |
|---|---|
| Coefficient of variation (CV) | 0.45421088 |
| Kurtosis | 2.9669448 |
| Mean | 11.311153 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.2130188 |
| Sum | 4479556 |
| Variance | 26.395435 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 36779 | 9.3% |
| 10 | 35441 | 8.9% |
| 8 | 35137 | 8.9% |
| 11 | 32695 | 8.3% |
| 7 | 31328 | 7.9% |
| 12 | 29157 | 7.4% |
| 6 | 25927 | 6.5% |
| 13 | 24983 | 6.3% |
| 14 | 21173 | 5.3% |
| 5 | 18308 | 4.6% |
| Other values (51) | 105102 |
| Value | Count | Frequency (%) |
| 0 | 6 | < 0.1% |
| 1 | 85 | < 0.1% |
| 2 | 1459 | 0.4% |
| 3 | 4783 | 1.2% |
| 4 | 10709 | 2.7% |
| 5 | 18308 | |
| 6 | 25927 | |
| 7 | 31328 | |
| 8 | 35137 | |
| 9 | 36779 |
| Value | Count | Frequency (%) |
| 90 | 1 | < 0.1% |
| 76 | 2 | < 0.1% |
| 58 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 56 | 2 | < 0.1% |
| 55 | 2 | < 0.1% |
| 54 | 3 | |
| 53 | 6 | |
| 52 | 3 | |
| 51 | 4 |
pub_rec
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.17819105 |
| Minimum | 0 |
|---|---|
| Maximum | 86 |
| Zeros | 338272 |
| Zeros (%) | 85.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 86 |
| Range | 86 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5306706 |
|---|---|
| Coefficient of variation (CV) | 2.9780991 |
| Kurtosis | 1867.4666 |
| Mean | 0.17819105 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.576564 |
| Sum | 70569 |
| Variance | 0.28161129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 338272 | |
| 1 | 49739 | 12.6% |
| 2 | 5476 | 1.4% |
| 3 | 1521 | 0.4% |
| 4 | 527 | 0.1% |
| 5 | 237 | 0.1% |
| 6 | 122 | < 0.1% |
| 7 | 56 | < 0.1% |
| 8 | 34 | < 0.1% |
| 9 | 12 | < 0.1% |
| Other values (10) | 34 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 338272 | |
| 1 | 49739 | 12.6% |
| 2 | 5476 | 1.4% |
| 3 | 1521 | 0.4% |
| 4 | 527 | 0.1% |
| 5 | 237 | 0.1% |
| 6 | 122 | < 0.1% |
| 7 | 56 | < 0.1% |
| 8 | 34 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 86 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 17 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 13 | 4 | < 0.1% |
| 12 | 4 | < 0.1% |
| 11 | 8 | |
| 10 | 11 |
revol_bal
Real number (ℝ)
| Distinct | 55622 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15844.54 |
| Minimum | 0 |
|---|---|
| Maximum | 1743266 |
| Zeros | 2128 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1685 |
| Q1 | 6025 |
| median | 11181 |
| Q3 | 19620 |
| 95-th percentile | 41066.55 |
| Maximum | 1743266 |
| Range | 1743266 |
| Interquartile range (IQR) | 13595 |
Descriptive statistics
| Standard deviation | 20591.836 |
|---|---|
| Coefficient of variation (CV) | 1.2996172 |
| Kurtosis | 384.22109 |
| Mean | 15844.54 |
| Median Absolute Deviation (MAD) | 6112 |
| Skewness | 11.727515 |
| Sum | 6.2749131 × 109 |
| Variance | 4.2402371 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2128 | 0.5% |
| 5655 | 41 | < 0.1% |
| 6095 | 38 | < 0.1% |
| 7792 | 38 | < 0.1% |
| 3953 | 37 | < 0.1% |
| 5098 | 36 | < 0.1% |
| 6077 | 36 | < 0.1% |
| 4541 | 35 | < 0.1% |
| 5389 | 35 | < 0.1% |
| 5235 | 35 | < 0.1% |
| Other values (55612) | 393571 |
| Value | Count | Frequency (%) |
| 0 | 2128 | |
| 1 | 30 | < 0.1% |
| 2 | 26 | < 0.1% |
| 3 | 28 | < 0.1% |
| 4 | 20 | < 0.1% |
| 5 | 23 | < 0.1% |
| 6 | 30 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 21 | < 0.1% |
| 9 | 23 | < 0.1% |
| Value | Count | Frequency (%) |
| 1743266 | 1 | |
| 1298783 | 1 | |
| 1190046 | 1 | |
| 1030826 | 1 | |
| 1023940 | 1 | |
| 975800 | 1 | |
| 867528 | 1 | |
| 838698 | 1 | |
| 814300 | 1 | |
| 778614 | 1 |
revol_util
Real number (ℝ)
| Distinct | 1226 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 276 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.791749 |
| Minimum | 0 |
|---|---|
| Maximum | 892.3 |
| Zeros | 2213 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11.2 |
| Q1 | 35.8 |
| median | 54.8 |
| Q3 | 72.9 |
| 95-th percentile | 92 |
| Maximum | 892.3 |
| Range | 892.3 |
| Interquartile range (IQR) | 37.1 |
Descriptive statistics
| Standard deviation | 24.452193 |
|---|---|
| Coefficient of variation (CV) | 0.45457145 |
| Kurtosis | 2.7122782 |
| Mean | 53.791749 |
| Median Absolute Deviation (MAD) | 18.5 |
| Skewness | -0.07177802 |
| Sum | 21288300 |
| Variance | 597.90975 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2213 | 0.6% |
| 53 | 752 | 0.2% |
| 60 | 739 | 0.2% |
| 61 | 734 | 0.2% |
| 55 | 730 | 0.2% |
| 54 | 725 | 0.2% |
| 62 | 721 | 0.2% |
| 47 | 720 | 0.2% |
| 57 | 719 | 0.2% |
| 58 | 717 | 0.2% |
| Other values (1216) | 386984 |
| Value | Count | Frequency (%) |
| 0 | 2213 | |
| 0.01 | 1 | < 0.1% |
| 0.04 | 1 | < 0.1% |
| 0.05 | 1 | < 0.1% |
| 0.1 | 253 | 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.2 | 211 | 0.1% |
| 0.3 | 187 | < 0.1% |
| 0.4 | 189 | < 0.1% |
| 0.46 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 892.3 | 1 | |
| 153 | 1 | |
| 152.5 | 1 | |
| 150.7 | 1 | |
| 148 | 1 | |
| 146.1 | 1 | |
| 145.8 | 1 | |
| 140.4 | 1 | |
| 136.7 | 1 | |
| 132.1 | 1 |
total_acc
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 118 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.414744 |
| Minimum | 2 |
|---|---|
| Maximum | 151 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 17 |
| median | 24 |
| Q3 | 32 |
| 95-th percentile | 47 |
| Maximum | 151 |
| Range | 149 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.886991 |
|---|---|
| Coefficient of variation (CV) | 0.46772027 |
| Kurtosis | 1.20462 |
| Mean | 25.414744 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.86432764 |
| Sum | 10065001 |
| Variance | 141.30055 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 14280 | 3.6% |
| 22 | 14260 | 3.6% |
| 20 | 14228 | 3.6% |
| 23 | 13923 | 3.5% |
| 24 | 13878 | 3.5% |
| 19 | 13876 | 3.5% |
| 18 | 13710 | 3.5% |
| 17 | 13495 | 3.4% |
| 25 | 13225 | 3.3% |
| 26 | 12799 | 3.2% |
| Other values (108) | 258356 |
| Value | Count | Frequency (%) |
| 2 | 18 | < 0.1% |
| 3 | 327 | 0.1% |
| 4 | 1238 | 0.3% |
| 5 | 2028 | 0.5% |
| 6 | 2923 | 0.7% |
| 7 | 4143 | |
| 8 | 5365 | |
| 9 | 6362 | |
| 10 | 7672 | |
| 11 | 8844 |
| Value | Count | Frequency (%) |
| 151 | 1 | |
| 150 | 1 | |
| 135 | 1 | |
| 129 | 1 | |
| 124 | 1 | |
| 118 | 1 | |
| 117 | 1 | |
| 116 | 2 | |
| 115 | 1 | |
| 111 | 2 |
initial_list_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.9 MiB |
| f | |
|---|---|
| w |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 396030 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | w |
|---|---|
| 2nd row | f |
| 3rd row | f |
| 4th row | f |
| 5th row | f |
Common Values
| Value | Count | Frequency (%) |
| f | 238066 | |
| w | 157964 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 238066 | |
| w | 157964 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 238066 | |
| w | 157964 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 396030 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| f | 238066 | |
| w | 157964 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 396030 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| f | 238066 | |
| w | 157964 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 396030 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| f | 238066 | |
| w | 157964 |
application_type
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.3 MiB |
| INDIVIDUAL | |
|---|---|
| JOINT | 425 |
| DIRECT_PAY | 286 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9946342 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3958175 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | INDIVIDUAL |
|---|---|
| 2nd row | INDIVIDUAL |
| 3rd row | INDIVIDUAL |
| 4th row | INDIVIDUAL |
| 5th row | INDIVIDUAL |
Common Values
| Value | Count | Frequency (%) |
| INDIVIDUAL | 395319 | |
| JOINT | 425 | 0.1% |
| DIRECT_PAY | 286 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| individual | 395319 | |
| joint | 425 | 0.1% |
| direct_pay | 286 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1186668 | |
| D | 790924 | |
| N | 395744 | 10.0% |
| A | 395605 | 10.0% |
| V | 395319 | 10.0% |
| U | 395319 | 10.0% |
| L | 395319 | 10.0% |
| T | 711 | < 0.1% |
| J | 425 | < 0.1% |
| O | 425 | < 0.1% |
| Other values (6) | 1716 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3958175 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 1186668 | |
| D | 790924 | |
| N | 395744 | 10.0% |
| A | 395605 | 10.0% |
| V | 395319 | 10.0% |
| U | 395319 | 10.0% |
| L | 395319 | 10.0% |
| T | 711 | < 0.1% |
| J | 425 | < 0.1% |
| O | 425 | < 0.1% |
| Other values (6) | 1716 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3958175 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 1186668 | |
| D | 790924 | |
| N | 395744 | 10.0% |
| A | 395605 | 10.0% |
| V | 395319 | 10.0% |
| U | 395319 | 10.0% |
| L | 395319 | 10.0% |
| T | 711 | < 0.1% |
| J | 425 | < 0.1% |
| O | 425 | < 0.1% |
| Other values (6) | 1716 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3958175 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 1186668 | |
| D | 790924 | |
| N | 395744 | 10.0% |
| A | 395605 | 10.0% |
| V | 395319 | 10.0% |
| U | 395319 | 10.0% |
| L | 395319 | 10.0% |
| T | 711 | < 0.1% |
| J | 425 | < 0.1% |
| O | 425 | < 0.1% |
| Other values (6) | 1716 | < 0.1% |
mort_acc
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 37795 |
| Missing (%) | 9.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8139908 |
| Minimum | 0 |
|---|---|
| Maximum | 34 |
| Zeros | 139777 |
| Zeros (%) | 35.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 34 |
| Range | 34 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.1479305 |
|---|---|
| Coefficient of variation (CV) | 1.1840911 |
| Kurtosis | 4.4771757 |
| Mean | 1.8139908 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6001324 |
| Sum | 649835 |
| Variance | 4.6136053 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 139777 | |
| 1 | 60416 | |
| 2 | 49948 | 12.6% |
| 3 | 38049 | 9.6% |
| 4 | 27887 | 7.0% |
| 5 | 18194 | 4.6% |
| 6 | 11069 | 2.8% |
| 7 | 6052 | 1.5% |
| 8 | 3121 | 0.8% |
| 9 | 1656 | 0.4% |
| Other values (23) | 2066 | 0.5% |
| (Missing) | 37795 | 9.5% |
| Value | Count | Frequency (%) |
| 0 | 139777 | |
| 1 | 60416 | |
| 2 | 49948 | 12.6% |
| 3 | 38049 | 9.6% |
| 4 | 27887 | 7.0% |
| 5 | 18194 | 4.6% |
| 6 | 11069 | 2.8% |
| 7 | 6052 | 1.5% |
| 8 | 3121 | 0.8% |
| 9 | 1656 | 0.4% |
| Value | Count | Frequency (%) |
| 34 | 1 | < 0.1% |
| 32 | 2 | < 0.1% |
| 31 | 2 | < 0.1% |
| 30 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 3 | < 0.1% |
| 26 | 2 | < 0.1% |
| 25 | 4 | < 0.1% |
| 24 | 10 | |
| 23 | 2 | < 0.1% |
pub_rec_bankruptcies
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 535 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12164756 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 350380 |
| Zeros (%) | 88.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.35617428 |
|---|---|
| Coefficient of variation (CV) | 2.9279197 |
| Kurtosis | 18.10416 |
| Mean | 0.12164756 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4234404 |
| Sum | 48111 |
| Variance | 0.12686012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 350380 | |
| 1 | 42790 | 10.8% |
| 2 | 1847 | 0.5% |
| 3 | 351 | 0.1% |
| 4 | 82 | < 0.1% |
| 5 | 32 | < 0.1% |
| 6 | 7 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| (Missing) | 535 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 350380 | |
| 1 | 42790 | 10.8% |
| 2 | 1847 | 0.5% |
| 3 | 351 | 0.1% |
| 4 | 82 | < 0.1% |
| 5 | 32 | < 0.1% |
| 6 | 7 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 7 | < 0.1% |
| 5 | 32 | < 0.1% |
| 4 | 82 | < 0.1% |
| 3 | 351 | 0.1% |
| 2 | 1847 | 0.5% |
| 1 | 42790 | 10.8% |
| 0 | 350380 |
address
Text
| Distinct | 393700 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.4 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 60 |
| Mean length | 44.713951 |
| Min length | 20 |
Characters and Unicode
| Total characters | 17708066 |
|---|---|
| Distinct characters | 67 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 391984 ? |
|---|---|
| Unique (%) | 99.0% |
Sample
| 1st row | 0174 Michelle Gateway Mendozaberg, OK 22690 |
|---|---|
| 2nd row | 1076 Carney Fort Apt. 347 Loganmouth, SD 05113 |
| 3rd row | 87025 Mark Dale Apt. 269 New Sabrina, WV 05113 |
| 4th row | 823 Reid Ford Delacruzside, MA 00813 |
| 5th row | 679 Luna Roads Greggshire, VA 11650 |
| Value | Count | Frequency (%) |
| suite | 88417 | 3.0% |
| apt | 88400 | 3.0% |
| 70466 | 56986 | 2.0% |
| 30723 | 56548 | 1.9% |
| 22690 | 56527 | 1.9% |
| 48052 | 55920 | 1.9% |
| 00813 | 45826 | 1.6% |
| 29597 | 45472 | 1.6% |
| 05113 | 45403 | 1.6% |
| box | 28349 | 1.0% |
| Other values (108604) | 2352838 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2128626 | 12.0% | |
| e | 911545 | 5.1% |
| a | 735427 | 4.2% |
| t | 702787 | 4.0% |
| r | 656748 | 3.7% |
| 0 | 624825 | 3.5% |
| i | 580043 | 3.3% |
| o | 579480 | 3.3% |
| n | 551350 | 3.1% |
| 2 | 487525 | 2.8% |
| Other values (57) | 9749710 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17708066 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2128626 | 12.0% | |
| e | 911545 | 5.1% |
| a | 735427 | 4.2% |
| t | 702787 | 4.0% |
| r | 656748 | 3.7% |
| 0 | 624825 | 3.5% |
| i | 580043 | 3.3% |
| o | 579480 | 3.3% |
| n | 551350 | 3.1% |
| 2 | 487525 | 2.8% |
| Other values (57) | 9749710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17708066 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2128626 | 12.0% | |
| e | 911545 | 5.1% |
| a | 735427 | 4.2% |
| t | 702787 | 4.0% |
| r | 656748 | 3.7% |
| 0 | 624825 | 3.5% |
| i | 580043 | 3.3% |
| o | 579480 | 3.3% |
| n | 551350 | 3.1% |
| 2 | 487525 | 2.8% |
| Other values (57) | 9749710 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17708066 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2128626 | 12.0% | |
| e | 911545 | 5.1% |
| a | 735427 | 4.2% |
| t | 702787 | 4.0% |
| r | 656748 | 3.7% |
| 0 | 624825 | 3.5% |
| i | 580043 | 3.3% |
| o | 579480 | 3.3% |
| n | 551350 | 3.1% |
| 2 | 487525 | 2.8% |
| Other values (57) | 9749710 |
| annual_inc | application_type | dti | emp_length | grade | home_ownership | initial_list_status | installment | int_rate | loan_amnt | loan_status | mort_acc | open_acc | pub_rec | pub_rec_bankruptcies | purpose | revol_bal | revol_util | sub_grade | term | total_acc | verification_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| annual_inc | 1.000 | 0.000 | -0.203 | 0.000 | 0.001 | 0.000 | 0.003 | 0.470 | -0.097 | 0.489 | 0.003 | 0.379 | 0.240 | -0.046 | -0.072 | 0.002 | 0.393 | 0.060 | 0.000 | 0.000 | 0.334 | 0.005 |
| application_type | 0.000 | 1.000 | 0.048 | 0.003 | 0.030 | 0.011 | 0.027 | 0.015 | 0.049 | 0.024 | 0.012 | 0.007 | 0.012 | 0.000 | 0.002 | 0.003 | 0.000 | 0.000 | 0.036 | 0.015 | 0.009 | 0.014 |
| dti | -0.203 | 0.048 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.056 | 0.172 | 0.053 | 0.002 | -0.048 | 0.323 | -0.042 | -0.033 | 0.000 | 0.250 | 0.185 | 0.000 | 0.000 | 0.237 | 0.000 |
| emp_length | 0.000 | 0.003 | 1.000 | 1.000 | 0.005 | 0.095 | 0.043 | 0.033 | 0.007 | 0.036 | 0.017 | 0.056 | 0.018 | 0.004 | 0.016 | 0.027 | 0.005 | 0.011 | 0.005 | 0.062 | 0.046 | 0.053 |
| grade | 0.001 | 0.030 | 0.000 | 0.005 | 1.000 | 0.040 | 0.056 | 0.104 | 0.721 | 0.097 | 0.258 | 0.029 | 0.016 | 0.004 | 0.038 | 0.089 | 0.002 | 0.098 | 1.000 | 0.468 | 0.028 | 0.160 |
| home_ownership | 0.000 | 0.011 | 0.000 | 0.095 | 0.040 | 1.000 | 0.048 | 0.071 | 0.044 | 0.084 | 0.068 | 0.138 | 0.060 | 0.000 | 0.005 | 0.086 | 0.015 | 0.014 | 0.047 | 0.100 | 0.104 | 0.046 |
| initial_list_status | 0.003 | 0.027 | 0.000 | 0.043 | 0.056 | 0.048 | 1.000 | 0.058 | 0.066 | 0.082 | 0.009 | 0.017 | 0.065 | 0.000 | 0.041 | 0.082 | 0.012 | 0.029 | 0.066 | 0.105 | 0.065 | 0.089 |
| installment | 0.470 | 0.015 | 0.056 | 0.033 | 0.104 | 0.071 | 0.058 | 1.000 | 0.137 | 0.968 | 0.057 | 0.202 | 0.208 | -0.093 | -0.103 | 0.097 | 0.460 | 0.132 | 0.093 | 0.257 | 0.217 | 0.217 |
| int_rate | -0.097 | 0.049 | 0.172 | 0.007 | 0.721 | 0.044 | 0.066 | 0.137 | 1.000 | 0.131 | 0.246 | -0.103 | 0.004 | 0.072 | 0.061 | 0.071 | 0.006 | 0.304 | 0.721 | 0.441 | -0.051 | 0.167 |
| loan_amnt | 0.489 | 0.024 | 0.053 | 0.036 | 0.097 | 0.084 | 0.082 | 0.968 | 0.131 | 1.000 | 0.065 | 0.231 | 0.215 | -0.100 | -0.109 | 0.100 | 0.470 | 0.105 | 0.084 | 0.411 | 0.237 | 0.234 |
| loan_status | 0.003 | 0.012 | 0.002 | 0.017 | 0.258 | 0.068 | 0.009 | 0.057 | 0.246 | 0.065 | 1.000 | 0.055 | 0.028 | 0.006 | 0.010 | 0.059 | 0.008 | 0.039 | 0.264 | 0.173 | 0.020 | 0.086 |
| mort_acc | 0.379 | 0.007 | -0.048 | 0.056 | 0.029 | 0.138 | 0.017 | 0.202 | -0.103 | 0.231 | 0.055 | 1.000 | 0.142 | 0.032 | 0.040 | 0.021 | 0.239 | 0.008 | 0.025 | 0.069 | 0.405 | 0.049 |
| open_acc | 0.240 | 0.012 | 0.323 | 0.018 | 0.016 | 0.060 | 0.065 | 0.208 | 0.004 | 0.215 | 0.028 | 0.142 | 1.000 | -0.019 | -0.025 | 0.036 | 0.364 | -0.139 | 0.016 | 0.076 | 0.672 | 0.044 |
| pub_rec | -0.046 | 0.000 | -0.042 | 0.004 | 0.004 | 0.000 | 0.000 | -0.093 | 0.072 | -0.100 | 0.006 | 0.032 | -0.019 | 1.000 | 0.862 | 0.007 | -0.209 | -0.095 | 0.006 | 0.001 | 0.033 | 0.004 |
| pub_rec_bankruptcies | -0.072 | 0.002 | -0.033 | 0.016 | 0.038 | 0.005 | 0.041 | -0.103 | 0.061 | -0.109 | 0.010 | 0.040 | -0.025 | 0.862 | 1.000 | 0.014 | -0.205 | -0.091 | 0.036 | 0.020 | 0.042 | 0.032 |
| purpose | 0.002 | 0.003 | 0.000 | 0.027 | 0.089 | 0.086 | 0.082 | 0.097 | 0.071 | 0.100 | 0.059 | 0.021 | 0.036 | 0.007 | 0.014 | 1.000 | 0.004 | 0.026 | 0.063 | 0.088 | 0.034 | 0.062 |
| revol_bal | 0.393 | 0.000 | 0.250 | 0.005 | 0.002 | 0.015 | 0.012 | 0.460 | 0.006 | 0.470 | 0.008 | 0.239 | 0.364 | -0.209 | -0.205 | 0.004 | 1.000 | 0.420 | 0.000 | 0.006 | 0.294 | 0.014 |
| revol_util | 0.060 | 0.000 | 0.185 | 0.011 | 0.098 | 0.014 | 0.029 | 0.132 | 0.304 | 0.105 | 0.039 | 0.008 | -0.139 | -0.095 | -0.091 | 0.026 | 0.420 | 1.000 | 0.101 | 0.022 | -0.105 | 0.039 |
| sub_grade | 0.000 | 0.036 | 0.000 | 0.005 | 1.000 | 0.047 | 0.066 | 0.093 | 0.721 | 0.084 | 0.264 | 0.025 | 0.016 | 0.006 | 0.036 | 0.063 | 0.000 | 0.101 | 1.000 | 0.481 | 0.025 | 0.169 |
| term | 0.000 | 0.015 | 0.000 | 0.062 | 0.468 | 0.100 | 0.105 | 0.257 | 0.441 | 0.411 | 0.173 | 0.069 | 0.076 | 0.001 | 0.020 | 0.088 | 0.006 | 0.022 | 0.481 | 1.000 | 0.101 | 0.218 |
| total_acc | 0.334 | 0.009 | 0.237 | 0.046 | 0.028 | 0.104 | 0.065 | 0.217 | -0.051 | 0.237 | 0.020 | 0.405 | 0.672 | 0.033 | 0.042 | 0.034 | 0.294 | -0.105 | 0.025 | 0.101 | 1.000 | 0.061 |
| verification_status | 0.005 | 0.014 | 0.000 | 0.053 | 0.160 | 0.046 | 0.089 | 0.217 | 0.167 | 0.234 | 0.086 | 0.049 | 0.044 | 0.004 | 0.032 | 0.062 | 0.014 | 0.039 | 0.169 | 0.218 | 0.061 | 1.000 |
| loan_amnt | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | purpose | title | dti | earliest_cr_line | open_acc | pub_rec | revol_bal | revol_util | total_acc | initial_list_status | application_type | mort_acc | pub_rec_bankruptcies | address | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10000.0 | 36 months | 11.44 | 329.48 | B | B4 | Marketing | 10+ years | RENT | 117000.0 | Not Verified | 2015-01-01 | Fully Paid | vacation | Vacation | 26.24 | 1990-06-01 | 16.0 | 0.0 | 36369.0 | 41.8 | 25.0 | w | INDIVIDUAL | 0.0 | 0.0 | 0174 Michelle Gateway\r\nMendozaberg, OK 22690 |
| 1 | 8000.0 | 36 months | 11.99 | 265.68 | B | B5 | Credit analyst | 4 years | MORTGAGE | 65000.0 | Not Verified | 2015-01-01 | Fully Paid | debt_consolidation | Debt consolidation | 22.05 | 2004-07-01 | 17.0 | 0.0 | 20131.0 | 53.3 | 27.0 | f | INDIVIDUAL | 3.0 | 0.0 | 1076 Carney Fort Apt. 347\r\nLoganmouth, SD 05113 |
| 2 | 15600.0 | 36 months | 10.49 | 506.97 | B | B3 | Statistician | < 1 year | RENT | 43057.0 | Source Verified | 2015-01-01 | Fully Paid | credit_card | Credit card refinancing | 12.79 | 2007-08-01 | 13.0 | 0.0 | 11987.0 | 92.2 | 26.0 | f | INDIVIDUAL | 0.0 | 0.0 | 87025 Mark Dale Apt. 269\r\nNew Sabrina, WV 05113 |
| 3 | 7200.0 | 36 months | 6.49 | 220.65 | A | A2 | Client Advocate | 6 years | RENT | 54000.0 | Not Verified | 2014-11-01 | Fully Paid | credit_card | Credit card refinancing | 2.60 | 2006-09-01 | 6.0 | 0.0 | 5472.0 | 21.5 | 13.0 | f | INDIVIDUAL | 0.0 | 0.0 | 823 Reid Ford\r\nDelacruzside, MA 00813 |
| 4 | 24375.0 | 60 months | 17.27 | 609.33 | C | C5 | Destiny Management Inc. | 9 years | MORTGAGE | 55000.0 | Verified | 2013-04-01 | Charged Off | credit_card | Credit Card Refinance | 33.95 | 1999-03-01 | 13.0 | 0.0 | 24584.0 | 69.8 | 43.0 | f | INDIVIDUAL | 1.0 | 0.0 | 679 Luna Roads\r\nGreggshire, VA 11650 |
| 5 | 20000.0 | 36 months | 13.33 | 677.07 | C | C3 | HR Specialist | 10+ years | MORTGAGE | 86788.0 | Verified | 2015-09-01 | Fully Paid | debt_consolidation | Debt consolidation | 16.31 | 2005-01-01 | 8.0 | 0.0 | 25757.0 | 100.6 | 23.0 | f | INDIVIDUAL | 4.0 | 0.0 | 1726 Cooper Passage Suite 129\r\nNorth Deniseberg, DE 30723 |
| 6 | 18000.0 | 36 months | 5.32 | 542.07 | A | A1 | Software Development Engineer | 2 years | MORTGAGE | 125000.0 | Source Verified | 2015-09-01 | Fully Paid | home_improvement | Home improvement | 1.36 | 2005-08-01 | 8.0 | 0.0 | 4178.0 | 4.9 | 25.0 | f | INDIVIDUAL | 3.0 | 0.0 | 1008 Erika Vista Suite 748\r\nEast Stephanie, TX 22690 |
| 7 | 13000.0 | 36 months | 11.14 | 426.47 | B | B2 | Office Depot | 10+ years | RENT | 46000.0 | Not Verified | 2012-09-01 | Fully Paid | credit_card | No More Credit Cards | 26.87 | 1994-09-01 | 11.0 | 0.0 | 13425.0 | 64.5 | 15.0 | f | INDIVIDUAL | 0.0 | 0.0 | USCGC Nunez\r\nFPO AE 30723 |
| 8 | 18900.0 | 60 months | 10.99 | 410.84 | B | B3 | Application Architect | 10+ years | RENT | 103000.0 | Verified | 2014-10-01 | Fully Paid | debt_consolidation | Debt consolidation | 12.52 | 1994-06-01 | 13.0 | 0.0 | 18637.0 | 32.9 | 40.0 | w | INDIVIDUAL | 3.0 | 0.0 | USCGC Tran\r\nFPO AP 22690 |
| 9 | 26300.0 | 36 months | 16.29 | 928.40 | C | C5 | Regado Biosciences | 3 years | MORTGAGE | 115000.0 | Verified | 2012-04-01 | Fully Paid | debt_consolidation | Debt Consolidation | 23.69 | 1997-12-01 | 13.0 | 0.0 | 22171.0 | 82.4 | 37.0 | f | INDIVIDUAL | 1.0 | 0.0 | 3390 Luis Rue\r\nMauricestad, VA 00813 |
| loan_amnt | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | purpose | title | dti | earliest_cr_line | open_acc | pub_rec | revol_bal | revol_util | total_acc | initial_list_status | application_type | mort_acc | pub_rec_bankruptcies | address | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 396020 | 10000.0 | 36 months | 9.76 | 321.55 | B | B3 | Retirement Counselor | 10+ years | RENT | 40000.0 | Not Verified | 2015-12-01 | Fully Paid | debt_consolidation | Debt consolidation | 23.40 | 1988-01-01 | 9.0 | 0.0 | 8819.0 | 57.3 | 18.0 | w | INDIVIDUAL | 1.0 | 0.0 | 914 Alexander Mountains Apt. 604\r\nEast Marco, VT 70466 |
| 396021 | 3200.0 | 36 months | 5.42 | 96.52 | A | A1 | St Francis Medical Center | 10+ years | RENT | 33000.0 | Not Verified | 2011-02-01 | Fully Paid | debt_consolidation | 2011 Insurance and Debt Consolidation | 21.45 | 1996-11-01 | 18.0 | 0.0 | 3985.0 | 7.6 | 50.0 | f | INDIVIDUAL | NaN | 0.0 | 309 John Mission\r\nWest Marc, NY 00813 |
| 396022 | 12000.0 | 36 months | 12.29 | 400.24 | C | C1 | Data Center Specialist II | 1 year | RENT | 52100.0 | Source Verified | 2015-10-01 | Fully Paid | debt_consolidation | Debt consolidation | 17.28 | 2004-10-01 | 6.0 | 0.0 | 9580.0 | 66.1 | 18.0 | w | INDIVIDUAL | 0.0 | 0.0 | 532 Johnson Drive Apt. 185\r\nAndersonside, NY 70466 |
| 396023 | 22000.0 | 36 months | 18.92 | 805.55 | D | D4 | Operations Manager | 10+ years | MORTGAGE | 138000.0 | Not Verified | 2014-04-01 | Fully Paid | debt_consolidation | Debt consolidation | 24.43 | 1998-05-01 | 18.0 | 0.0 | 22287.0 | 50.4 | 39.0 | f | INDIVIDUAL | 4.0 | 0.0 | 0297 Flores Dale Suite 441\r\nTaylorland, MD 05113 |
| 396024 | 6000.0 | 36 months | 13.11 | 202.49 | B | B4 | Michael's Arts & Crafts | 5 years | RENT | 64000.0 | Not Verified | 2013-03-01 | Fully Paid | debt_consolidation | Credit buster | 10.81 | 1991-11-01 | 7.0 | 0.0 | 11456.0 | 97.1 | 9.0 | w | INDIVIDUAL | 0.0 | 0.0 | 514 Cynthia Park Apt. 402\r\nWest Williamside, SC 05113 |
| 396025 | 10000.0 | 60 months | 10.99 | 217.38 | B | B4 | licensed bankere | 2 years | RENT | 40000.0 | Source Verified | 2015-10-01 | Fully Paid | debt_consolidation | Debt consolidation | 15.63 | 2004-11-01 | 6.0 | 0.0 | 1990.0 | 34.3 | 23.0 | w | INDIVIDUAL | 0.0 | 0.0 | 12951 Williams Crossing\r\nJohnnyville, DC 30723 |
| 396026 | 21000.0 | 36 months | 12.29 | 700.42 | C | C1 | Agent | 5 years | MORTGAGE | 110000.0 | Source Verified | 2015-02-01 | Fully Paid | debt_consolidation | Debt consolidation | 21.45 | 2006-02-01 | 6.0 | 0.0 | 43263.0 | 95.7 | 8.0 | f | INDIVIDUAL | 1.0 | 0.0 | 0114 Fowler Field Suite 028\r\nRachelborough, LA 05113 |
| 396027 | 5000.0 | 36 months | 9.99 | 161.32 | B | B1 | City Carrier | 10+ years | RENT | 56500.0 | Verified | 2013-10-01 | Fully Paid | debt_consolidation | pay off credit cards | 17.56 | 1997-03-01 | 15.0 | 0.0 | 32704.0 | 66.9 | 23.0 | f | INDIVIDUAL | 0.0 | 0.0 | 953 Matthew Points Suite 414\r\nReedfort, NY 70466 |
| 396028 | 21000.0 | 60 months | 15.31 | 503.02 | C | C2 | Gracon Services, Inc | 10+ years | MORTGAGE | 64000.0 | Verified | 2012-08-01 | Fully Paid | debt_consolidation | Loanforpayoff | 15.88 | 1990-11-01 | 9.0 | 0.0 | 15704.0 | 53.8 | 20.0 | f | INDIVIDUAL | 5.0 | 0.0 | 7843 Blake Freeway Apt. 229\r\nNew Michael, FL 29597 |
| 396029 | 2000.0 | 36 months | 13.61 | 67.98 | C | C2 | Internal Revenue Service | 10+ years | RENT | 42996.0 | Verified | 2010-06-01 | Fully Paid | debt_consolidation | Toxic Debt Payoff | 8.32 | 1998-09-01 | 3.0 | 0.0 | 4292.0 | 91.3 | 19.0 | f | INDIVIDUAL | NaN | 0.0 | 787 Michelle Causeway\r\nBriannaton, AR 48052 |